Evaluation of Tracheoesophageal Substitute Voices Using Prosodic Features

نویسندگان

  • Tino Haderlein
  • Elmar Nöth
  • Maria Schuster
  • Ulrich Eysholdt
  • Frank Rosanowski
چکیده

Tracheoesophageal (TE) speech is a possibility to restore the ability to speak after laryngectomy, i.e. after the removal of the larynx. TE speech often shows low audibility and intelligibility which makes it a challenge for the patients to communicate. In speech rehabilitation the patient’s voice quality has to be evaluated. As no objective classification means exists until now and an automation of this procedure is desirable, we performed initial experiments for automatic evaluation using prosodic features. Our reference were scoring results for several evaluation criteria for TE speech from five experienced raters. Correlation coefficients of up to 0.84 between human and automatic rating are promising for future work.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic evaluation of tracheoesophageal substitute voice: sustained vowel versus standard text.

OBJECTIVE The Hoarseness Diagram, a program for voice quality analysis used in German-speaking countries, was compared with an automatic speech recognition system with a module for prosodic analysis. The latter computed prosodic features on the basis of a text recording. We examined whether voice analysis of sustained vowels and text analysis correlate in tracheoesophageal speakers. PATIENTS ...

متن کامل

Automatic evaluation of tracheoesophageal substitute voices

In 20 to 40 percent of all cases of laryngeal cancer, total laryngectomy has to be performed, i.e. the removal of the entire larynx. For the patient, this means the loss of the natural voice and thus the loss of the main means of communication. A popular method of voice restoration involves a shunt valve (“voice prosthesis”) between trachea and pharyngoesophageal segment which establishes the t...

متن کامل

Creating expressive synthetic voices by unsupervised clustering of audiobooks

In this work we design an approach for automatic feature selection and voice creation for expressive synthesis. Our approach is guided by two main goals: (1) increasing the flexibility of expressive voice creation and (2) overcoming the limitations of speaking styles in expressive synthesis. We define a novel set of features, combining traditionally used prosodic features with spectral features...

متن کامل

Prosodic and Spectral iVectors for Expressive Speech Synthesis

This work presents a study on the suitability of prosodic and acoustic features, with a special focus on i-vectors, in expressive speech analysis and synthesis. For each utterance of two different databases, a laboratory recorded emotional acted speech, and an audiobook, several prosodic and acoustic features are extracted. Among them, i-vectors are built not only on the MFCC base, but also on ...

متن کامل

How vulnerable are prosodic features to professional imitators?

Voice imitation is one of the potential threats to security systems that use automatic speaker recognition. Since prosodic features have been considered for state-of-the-art recognition systems in recent years, the question arises as to how vulnerable these features are to voice mimicking. In this study, two experiments are conducted for twelve individual features in order to determine how a pr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008